Subpolynomial trace reconstruction for random strings and arbitrary deletion probability
نویسندگان
چکیده
The deletion-insertion channel takes as input a bit string x ∈ {0, 1}, and outputs a string where bits have been deleted and inserted independently at random. The trace reconstruction problem is to recover x from many independent outputs (called “traces”) of the deletion-insertion channel applied to x. We show that if x is chosen uniformly at random, then exp(O(log n)) traces suffice to reconstruct x with high probability. The earlier upper bounds were exp(O(log n)) for the deletion channel with deletion probability less than 1/2, and exp(O(n)) for the general case. A key ingredient in our proof is a two-step alignment procedure where we estimate the location in each trace corresponding to a given bit of x. The alignment is done by viewing the strings as random walks, and comparing the increments in the walk associated with the input string and the trace, respectively.
منابع مشابه
Study of Random Biased d-ary Tries Model
Tries are the most popular data structure on strings. We can construct d-ary tries by using strings over an alphabet leading to d-ary tries. Throughout the paper we assume that strings stored in trie are generated by an appropriate memory less source. In this paper, with a special combinatorial approach we extend their analysis for average profiles to d-ary tries. We use this combinatorial appr...
متن کاملTrace Reconstruction Revisited
The trace reconstruction problem is to reconstruct a string x of length n given m random subsequences where each subsequence is generated by deleting each character of x independently with probability p. Two natural questions are a) how large must m be as a function of n and p such that reconstruction is possible with high probability and b) how can this reconstruction be performed efficiently....
متن کاملTrace reconstruction with varying deletion probabilities
In the trace reconstruction problem an unknown string x = (x0, . . . , xn−1) ∈ {0, 1, ...,m − 1} is observed through the deletion channel, which deletes each xk with a certain probability, yielding a contracted string X̃. Earlier works have proved that if each xk is deleted with the same probability q ∈ [0, 1), then exp(O(n)) independent copies of the contracted string X̃ suffice to reconstruct x...
متن کاملSymbolic Channel Modelling for Noisy Channels Which Permit Arbitrary Noise Distributions
In this paper we present a new model for noisy channels which permit arbitrarily distributed substitution, deletion and insertion errors. Apart from its straightforward applications in string generation and recognition, the model also has potential applications in speech and unidimensional signal processing. The model is specified in terms of a noisy string generation technique. Let A be any fi...
متن کاملSecond Moment of Queue Size with Stationary Arrival Processes and Arbitrary Queue Discipline
In this paper we consider a queuing system in which the service times of customers are independent and identically distributed random variables, the arrival process is stationary and has the property of orderliness, and the queue discipline is arbitrary. For this queuing system we obtain the steady state second moment of the queue size in terms of the stationary waiting time distribution of a s...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1801.04783 شماره
صفحات -
تاریخ انتشار 2018